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Abstract 

We provide simple proofs describing the behavior of the largest 
component of the Erdos-Renyi random graph G(n,p) outside of the 
scaling window, p = where e(n) — > but e(n)n 1 / 3 — > oo. 

1 Introduction 

Consider the random graph G(n,p) obtained from the complete graph on 
n vertices by retaining each edge with probability p and deleting each edge 
with probability 1 — p. We denote by Cj the j-th largest component. Let 
e(n) be a non-negative sequence such that e(n) — > and e(n)n 1//3 — > oo. The 
following theorems, proved by Bollobas [4] and Luczak [8] using different 
methods, describe the behavior of the largest components when p is outside 
the "scaling-window" . 

Theorem 1 [Subcritical phase] If p(n) = — then for any rj > and 

integer £ > we have 



2e(n) 2 log(ne(n) 3 ) 



> rj 



as n — > cxo. 



Theorem 2 [Supercritical phase] // = 1+ ^ n ' ) t/ien /or any n > 
we /iaue 

|Ci| 



2ne(n) 
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and for any integer i > 1 we have 

\CA 



2e 2 (n)log(ne 3 ) 



> n 



as n — > oo. 



The proofs of these theorems in [1] and [8] are quite involved and use 
the detailed asymptotics from p3], [3] and [3] for the number of graphs on 
k vertices with k + t edges. The proofs we present here are simple and 
require no hard theorems. The main advantage, however, of these proofs 
is their robustness. In a companion paper [12] we use similar methods to 
analyze critical percolation on a random regular graphs. In this case, the 
enumerative methods employed in [4] and [8] are not available. 

The phase transition in the Erdos-Renyi random graphs G(n,p) happens 
when p= j-. Namely with high probability, if c > 1 then \C\\ is linear in n, 
and if c < 1 then \C\\ is logarithmic in n. When c ~ 1 the situation is more 
delicate. In [9], Luczak, Pittel and Wierman prove that for p = 1+A ^ — -, 
the law of n _2//3 |Ci| converges to a positive non-constant distribution which 
in [1] is identified as the longest excursion length of some Brownian motion 
with variable drift. See [TTJ for a recent account of the case p = 1/3 
with simple proofs. 

Thus, \C\\ is not concentrated and is roughly of size n 2 / 3 if p = 1+A ^ — — . 
However, if e(n) a sequence such that n 1//3 e(n) — * oo and p = 1+ ^ n - ) then as 
stated in Theorems Q] and [21 the size \C\\ of the largest component in G(n,p) 
is concentrated. In summary G(n,p) has a scaling window of length n -1 / 3 in 
which the percolation is "critical" in the sense that \C\\ is not concentrated. 



2 The exploration process 

We recall an exploration process, due to Karp and Martin-L6f (see [7] and 
|10j). in which vertices will be either active, explored or neutral. After the 
completion of step t E {0, 1, . . . , n} we will have precisely t explored vertices 
and the number of the active and neutral vertices is denoted by At and Nt 
respectively. 

Fix an ordering of the vertices {v%, . . . , v n }. In step t = of the process, 
we declare vertex v\ active and all other vertices neutral. Thus Aq = 1 and 
iVo = n — 1. In step t £ {1, . . . ,n}, if > let wt be the first active 
vertex; if At-i = 0, let wt be the first neutral vertex. Denote by rjt the 
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number of neutral neighbors of wt in G(n,p), and change the status of these 
vertices to active. Then, set wt itself explored. 

Denote by Tt the cr-algebra generated by {rji, . . . ,r/ t }. Observe that 
given Tt-\ the random variable r\ t is distributed as Bin(A r i _i — l{A t _i=o})P) 
and we have the recursions 

N t = N t - X -vt~ l{A t _ 1= o} * < n , (1) 

and 



A, 



At-i +Vt-h ^ > 
rj t , A t -! = 0, t<n. 

As every vertex is either neutral, active or explored, 



N t = n - t - A t , t < n . (3) 

At each time j < n in which Aj = 0, we have finished exploring a 
connected component. Hence the random variable Zf defined by 

t-l 

z t = Yl 1 {^=o> ' 

counts the number of components completely explored by the process before 
time t. Define the process {Yt} by lo = 1 an d 

Y t = Y t -! + nt - 1 . 

By ([2]) we have that Yt = At — Zt, i.e. Yt counts the number of active vertices 
at step t minus the number of components completely explored before step 
t. 

At each step we marked as explored precisely one vertex. Hence, the 
component of v\ has size min{t > 1 : At = 0}. Moreover, let t\ < t2 ■ ■ ■ 
be the times at which At - = 0; then (^i, ^2 — *3 — ^2, • • •) are the sizes of 
the components. Observe that Z% = Z-t- + 1 for all t £ {tj + 1, . . . 
Thus Y tj+1 = Y tj — 1 and if t G {tj + 1, . . . , tj+i — 1} then A t > 0, and thus 
^tj+i < Y t . By induction we conclude that A t = if and only if Y t < Y s 
for all s < t, i.e. At = if and only if {Yt} has hit a new record minimum 
at time t. By induction we also observe that Yt- = — (j ' — 1) and that for 
t £ {tj + we have Z t = j. Also, by our previous discussion 

for t £ {tj + 1, . . . tj + i} we have min s < t _i Y t = Y tj = — (j — 1), hence by 
induction we deduce that Z t = — mm s < t -i Y t + 1. Consequently, 

A t = Y t - min Y. + l. (4) 

s<t-l 
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Lemma 3 For all p < § there exists a constant c > such that for any 
integer t > 0, 

P (Nt < n - 5tj < e~ ct . 

Proof. Let {cti}\ =l be a sequence of i.i.d. random variables distributed 
as Bin(n,p). It is clear that we can couple rji and cti so rji < an for all i, and 
thus by CO) 

t 

N t >n-l-t-^2ai. (5) 

i=l 

The sum Yl\=i a i ^ s distributed as Bin(nt,p) and p < ^ so by Large Devia- 
tions (see j'2] section A. 14) we get that for some fixed c > 

t 

P(^a 4 >3t) <e- ct , 

i=l 

which together with ([5]) concludes the proof. □ 

3 The subcritical phase 

Before beginning the proof of Theorem [1] we require some facts about pro- 
cesses with i.i.d. increments. Fix some small e > and let p = for some 
integer m > 1. Let {f3j} be a sequence of random variables distributed as 
Bin(m,p). Let {Wtjt^o be a process defined by 

Wq = 1, W t = W t . x + (3 t - 1 . 

Let r be the hitting time of 0, 

t = min{W( = 0} . 

By Wald's lemma we have that Er = e . Further information on the 
tail distribution of r is given by the following lemma. 

Lemma 4 There exists constant C\, C2, c\, C2 > such that for all T > e~ 2 
we have 

/ o 0/0 (e 2 -cie 3 )T 

P(r > T) < Ci(e- 2 r- 3 / 2 e- i 2 



anc? 

P(t > T) > ci(V 2 r~ 3/2 £ 

Furthermore, 

Er 2 = 0(e~ 3 ) 
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We will use the following proposition due to Spitzer (see |13j). 



Proposition 5 Let ao, . . . , Ofc-i £ Z satisfy ^* = o a * = — 1- Then there is 
precisely one j E {0, . . . , k — 1} smc/i i/ia£ /or all r £ {0, . . . , k — 2} 

r 

^ a 0'+i) mod fc > . 
i=0 

Proof of Lemma [H By Proposition [3 P(r = t) = \P(W t = 0). As 
Y^j=i Pj 1S distributed as a Bin(mt,p) random variable we have 

p(w t = o) = r^j^Hi - P ) m -(*-^ . 

Replacing t— 1 with i in the above formula only changes it by a multiplicative 
constant which is always between 1/2 and 2. A straightforward computation 
using Stirling's approximation gives 



PW = o, = e{rv»(i - e) .(, + -l T )'"(i - . w 

Denote q = (l- e )(l + _L_) (l-^) , then 

-;/2 q t 



p(r > T) = p(t = t) = £ Jp(w* = o) = e(j> 

This sum can be bounded above by 

rp-3/2 t _ rp-3/2 1 

1 — Q ' 



and below by 



2T rp rp. 



Observe that as m — > oo we have that g tends to (1 — e)e e . By expanding e e 
we find that 

= (i- e) (i + e + ^) + e(e 3 ) = i-^ + e(6 3 ). 
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Using this and the previous bounds on P(r > T) we get the first assertion 
of the Lemma. 

The second assertion follows from the following computation. By ([6]) we 
have that for some constant C > 

Er 2 = ^t 2 P(r = t) = Y J tV{W t = 0) < Cj^ vV • 

t>l t>i t>l 

Thus, by direct computation (or by [6], section XIII. 5, Theorem 5) 

Er 2 <0(— ) 3/2 = 0(e- 3 ). 
VI — qJ 

□ 



Proof of Theorem [TJ We begin with an upper bound. Recall that com- 
ponent sizes are tj+i — tj for some j > where tj are record minima of the 
process {Yt}. For a vertex t> denote by C(v) the connected component of 
G(n,p) which contains v. We first bound P(|C(vi)| > T) where 

T = 2(1 + 7/)e- 2 log(ne 3 ) . 

Recall that |C(t>i)| = mhn.{Y t = 0}. Couple {Y t } with a process {Wt} as 
in Lemma HI which has increments distributed as Bin(n,p) — 1 such that 
Yt < Wt for all t. Define r as in Lemma [H As p = ^—^ and T > e~ 2 , by 
Lemma d] we have 

P(r > T) < CeK 3 )-^ log(ne 3 )- 3/2 , 

for some fixed C > 0. Our coupling implies that P(|C(i>i)| > T) < P(r > 
T). Denote by X the number of vertices v such that |C(t>)| > T. If |Ci| > T 
then X > T. Also, for any two vertices t> and u by symmetry we have that 
|C(u)| and |C(«)| are identically distributed. We conclude that 



P(| Cl |>T) < P ( X>T),M ^^yi>T) 

g 1 n g (ne 3 )-( 1 +^( 1 -^-)log- 3 / 2 (n e 3 ) f .._, (1 _ fte)+fta _ 
2(1 + r/)e- 2 log(ne 3 ) " l ; 

We now turn to prove a lower bound. Write 

T = 2(1 - r ? )e" 2 log(ne 3 ) , 
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and define the stopping time 

nen 

7 = mm{i : Nt < n } . 

8 

Recall that {tj} are times in which Af. = and also Yt- is a record minimum 
for {Yt}. For each integer j let {W^} be a process with increments dis- 
tributed as Bin(n — where the starting point is Wq^ = Yt j = — 

Note that if tj + \ < 7 then we can couple {Yt} and {W f } such that 
Ytj+t > for all t G [tj, ij+i]- Define the stopping times {t^} by 



Tj = min{t : = -j} . 



Take 



N 



We will prove that with high probability tjy < 7 and that there exists 
k\ < h% < . . . < kg < N such that 77- > T. Note that these two events 
imply that \Cg\ > T. Indeed, by Lemma [3] we have 

Pf7<^)<e— . (7) 



40 ) 

By bounding the increments of {Yt} above by variables distributed as Bin(n,p)- 
1 we learn by Wald's Lemma (see [5|) that E [ij+i — tj] < e _1 , hence 
~Et N < e" 2 (ne 3 ) (1 "f). We conclude that 

rjen, ^ 40e- 2 (ne 3 )( 1_ i) 40. ,._ 2 

P (tN > < - '- = — (ne 3 )"8 , (8) 

40 rjen n 

which goes to as en -1 / 3 tends to 00. In Lemma [J] take m = n — and 
note that p = — £ ^ — — > ^j-Lalf ( anc [ so Lemma U] gives that for any j 

P(tj >T)> c 1 e(ne 3 )-( 1+ l) 2 ( 1 -^( 1+C2e )log- 3 / 2 (e 3 n) > e(ne 3 )-^ . 

Let X be the number of j < JV such that r» > T. Then we have 

EX > Ne(ne 3 )- {1 -^ > C(ne 3 )f -> 00 , 

hence by Large Deviations (see [2], section A. 14) for any fixed integer i > 
we have 

P(x <£) < e~ c( - ne3 ^ , 
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for some fixed c > 0. By our previous discussion, this together with ([7]) and 
© gives 

P(|Q|<T)<0(^). 

□ 



4 The supercritical phase 

In this section we denote £t = rjt — 1. We first prove some Lemmas. 
Lemma 6 If p = ±^ then for all t < 3e(n)n 

BA t = 0(et + Vt), (9) 

and 

BZ t = 0{et + Vi). (10) 

Proof. Write T = 3en. We will use dH). First observe that as rjt can 
always be bounded above by a Bin(n,p) random variable we can bound 
E£i < e for all £ and hence EYj < et. Denote by r the stopping time 
r = min{t : iVf < n — 15en}. By definition of ?yt we have 

E [6 | ^t-l] = P^t-l " Pl{A t „ 1= 0} - 1 • 

As {Nt} is a decreasing sequence, we deduce that as long as t < r, we 
have E [£ t | > -De for Z) > large enough. Hence, the process 

{Dej—YjYj^Q is a submartingale for any t. By Doob's maximal I? inequality 
we have 

E [ max (Dej - Yj) 2 } < 4E \(De(t At) — Y tAr ) 2 } . (11) 

j<t/\T 

For any j < r the random variable rjj can be stochastically bounded 
from below by a Bin(n — 15en,p) random variable and above by a Bin(n,p) 
random variable. Hence for any k < j < t we have 

Efo-£>e|.F fc ]| = 0(e), 

and so 

E[(fc-£>e)(&- J De)] = 0(e 2 ). 
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We conclude that as long as t < r 

t t 
E[(Det-Y t ) 2 } < 2^Em-De)^ k -De)]+Y,^m-De) 2 } = 0(e 2 t 2 +t). 

k<j j=l 

Lemma [3] implies that for n large enough, 



P(N T < n- 15en) < e~ 3cen < , (12) 



1 

n 2 

and as {N t } is a decreasing sequence we deduce that P(r < T) < n~ 2 . 
Hence for any t <T 



E[(Det-Y t ) 2 } < E[(De(tAT)-Y tAT ) 2 l {t<T} } + 0(n 2 )P(t>T) 
= 0{e 2 t 2 + t). 



We deduce by (|TTj) and Jensen inequality that for any t <T 



E [mm(Yj - Dej)] = 0{et + Vt) , 



hence E [min 3 <t Yj] = 0(et + s/i) and so by Q we obtain (|9|). Inequality 
(|10p follows immediately from the relation Zt = -At — Yf. □ 



Lemma 7 If p = then for all t < 3e(n)n 

EN t = n{l - pY + 0{e 2 n) , (13) 

and 

E6 = e--+0(e 2 ). (14) 
n 

PROOF. Observe that by ([1]) we have that 

E [N t | F t -i] = (1 - p)iV t _! - (1 - p)l { A t _ 1= o} • 

By iterating this relation we get that ENt = n(l — p) + 0(EZt) which 
by Lemma [6] yields (fTB"j) (observe that for t = 3en we have et > yt by our 
assumption on e). Since 

I ^t-l] = pNt-i - pl{A t _ 1= o} - 1 , 
by taking expectations and using ([TBI we get 



9 



E6 



(l + e )(l_i±£)*_l + 0(e 5 



n 



= (1 + e)(l - (1 + e)t/n) - 1 + 0(e 2 ) = e - - + 0(e 2 ) , 

n 

where we used the fact that (1 — x) 1 = 1 — tx + 0(t 2 x 2 ). □ 
Proof of Theorem [2J Write T = 3en and = E \ Fj-i]. The process 

t 

'-./ 



3=1 



is a martingale. By Doob's maximal L 2 inequality we have that 

E(maxM ( 2 )) < 4E M|. 

As Mt has orthogonal increments with bounded second moment we conclude 
that EMj, = 0(T), hence, by Jensen's inequality we have 



E 



max 

t<T 



t 



(15) 



3=1 



As £j = pNj-i - pl {Aj _ 1=0} - 1 by © we have 

E \Cj - E&| =pE \A^x + 1 {Aj . 1= o} ~ E A,-_i - E 1 {Aj _ 1= o}I • 
By the triangle inequality and Lemma [6] we conclude that for all j < T 

E|£-E&| < P -0(ej + ^/j), 
and hence for any t <T 



E 



[EG 



E6 



< p • 0(et 2 + t 3 / 2 ) < 0(e 3 



3<t 



By the triangle inequality we get 



E 



(16) 



3=1 



Using the triangle inequality, (I15p . (I16p and Markov inequality gives 
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z 

P(nmx Ft- J^Efj >5e 2 'nj<5- l (0(e)+0{{e 3 n)- 1 / 2 )) — >0. (17) 



3=1 

Lemma [7] implies that for any b > 



6en 6en ,2 

E E ^ = E ( e - - + °( e2 )) = ( 6 - 7)^ + 0(e3n) • (18) 

By (|17|) and (|18|) we deduce that for 77 > small enough, with probability 
tending to 1, the process Y t is strictly positive at times [rjen, (2 — 77) en] and 
hence 



P 



> 2(1 - 7?)en) > 1 - o{rf x (t + (e 3 n)~ 1/2 )) 



We also deduce by (fTTl) and (fT8j) that at time f = (2 + 77) en we have Yt < 
— ^-e 2 n and at all times i < 77en we have that Yt > — ^-e 2 n with probability 
tending to 1. As component sizes are excursion lengths of Yt above its past 
minima, we conclude that by time 2(1 + 77) en we have explored completely 
at least one component of size at least 2(1 — 77) en. As iVj < n — t for 
t > 2(1 — r])en and 77 < 1/4 we have E [rjt — 1 | Ft-i] < — § • By optional 
stopping we immediately get that if tj > 2(1 — r])en then E [tj+i—tj] < 2e~ 1 . 
Thus if C is a component which we began discovering after time 2(1 — 77)en 
we have 

P(|C| > en) < . 

e z n 

Denote by X(rj) the number of vertices of which |C(u)| > en which we 
began discovering after time 2(1 — r/)en, then we learn that EI())) < 
Denote by Ci(?7) the largest component which we began discovering after 
time 2(1 — 7y)en. Clearly if |Ci(??)| > en then ^(77) > en, thus by Markov 
inequality 

P(|C 1 (7 ? )|>en)<^-^0. 

e J n 

Thus we have proved that there exists a unique component of size between 
2(1 — ?7)en and 2(1 + ?7)en. Condition on this event and consider the graph 
remained on the complement of this component. This graph has m vertices 
where 

\m — (n — 2en)| < 2??en , 
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and as p = we have that 



V mJ 



2ne + 0(e 2 ) 



rn 



This graph is distributed as G(m,p) restricted to the event that it does 
not contain a component of size between 2(1 — rj)en and 2(1 + rj)en. By 
Theorem Q] we know that the event that there exists such a component has 
probability o(l). Thus for any collection of graphs B on m vertices which 
does not contain such a component, the probability of B in the remaining 
graph is (1 + o(l))P m)P (/3) where P m>p is the usual G(m,p) probability 
measure. Thus, we conclude by Theorem [1] that for any integer I > 1 and 



Remark. With a little more effort it is possible to show for the super- 
critical case, that in the exploration process for any fixed £, the £-th largest 
component is discovered after the largest component is discovered. 
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